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[0005] In consideration of the aforementioned problem of the 
prior art, therefore, an object of the present invention is 
5 to provide a voice recognition unit supplying a signal of a 
constant level to a subsequent voice recognizer also when the 
level of a voice conversion voltage signal from voice-electric 
conversion means varies with the distance from the conversion 
means or the volume of the voice thereby improving the 
10 recognition rate. 
[0006] 

[Means for Solving the Problem] In order to attain the 
aforementioned object, a voice recognition unit according to 
the present invention comprises voice-electric conversion 

15 means generating a voice conversion voltage signal having a 
magnitude responsive to the level of a voice input, 
amplification means amplifying the voice conversion voltage 
signal from the voice-electric conversion means, gain setting 
means setting the gain of the amplification means, a voice 

20 recognizer receiving the voice conversion voltage signal from 
the said voice-electric conversion means through the said 
amplification means and performing voice recognition, compare 
means comparing the voice conversion voltage signal from the 
said voice-electric conversion means with a predetermined 

25 reference voltage and outputting a set signal when the said 
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voice conversion voltage signal exceeds the reference voltage, 
timer means reset by a reset signal generated by manipulation 
of a speaker for outputting a time signal sustained for a 
constant time by receiving the set signal from the said compare 
5 means in the reset state and peak detection means detecting 
the peak of the said voice conversion voltage signal received 
while the timer means outputs the time signal, and the said 
gain set means sets gain of the said amplification means to 
a value inversely proportional to the peak detected by the said 
10 peak detection means. 
E0007] 

[Function] In the aforementioned structure, the timer means 
is reset by the reset signal generated by manipulation of the 
speaker before speaking. The reset timer means outputs the 

15 time signal sustained for the constant time by receiving the 
set signal output from the compare means when the voice 
conversion voltage signal from the voice-electric conversion 
means responsive to the speaker exceeds the predetermined 
reference voltage, the peak detection means detects the peak 

20 of the currently received voice conversion voltage signal, and 
the gain set means sets the gain of the amplification means 
to the value inversely proportional to the peak detected by 
the peak detection means. As hereinabove described, the 
subsequent signal level is predicted in response to the level 

25 of the peak of the head of the speaker's voice for setting the 
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gain inversely proportional to this peak in the eunplif ication 
means , whereby the gain of the amplification means is rendered 
large for a small voice and rendered small for a large voice 
and the voice conversion voltage signal amplified by the 
5 amplification means to correspond to the subsequent voice is 
regularly supplied to the voice recognizer at a constant level, 
for preventing false recognition resulting from the distance 
between the voice-electric conversion means and the speaker 
or the volume of the voice. 
10 [0008] 

[Embodiment] An embodiment of the voice recognition unit 
according to the present invention is now described with 
reference to the drawings. Fig. 1 is a block diagram showing 
the embodiment of the voice recognition unit according to the 

15 present invention, and numeral 11 denotes an input terminal 
receiving a voice conversion voltage signal (hereinafter 
simply referred to as a voice signal) generated on the basis 
of a voice picked up by a microphone (not shown) serving as 
voice-electric conversion means. 

20 [ 0009 ] Numeral 12 denotes a comparator serving as compare means , 
and the voice signal and a reference voltage Vr obtained by 
performing division with resistors and are applied to first 
and second inputs of the comparator 12 respectively. Numeral 
13 denotes a latch circuit formed by an R-S flip-flop, for 

25 example, and a set signal from an output of the comparator 12 
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and a reset signal from a reset input terminal 13' are input 
in a set input and a reset input thereof respectively. Numeral 
14 denotes a one- shot time circuit triggered by an output of 
the latch circuit 13 for generating a time signal sustained 
5 for a constant time, and numeral 15 denotes a peak hold circuit 
detecting the peak of the voice signal from the input terminal 
11 in the period of the time signal from the one-shot timer 
circuit 14, and holding the level thereof and outputting the 
same. Numeral 16 denotes an analog-digital (A-D) converter 

10 digitizing an analog output from the peak hold circuit 15 and 
outputting the same. Numeral 17 denotes a decoder decoding 
the digitized peak and numerals 18a to 18d denote analog 
switches on-off controlled by an output from the decoder 17, 
while the analog switches 18a to 18d are connected in parallel 

15 with serially connected resistors Ra to Rd. 

[0010] Numeral 19 denotes an amplifier connected to the input 
terminal 11 through the aforementioned parallel circuits of 
the analog switches 18a to 18d and the resistors Ra to Rd, so 
that its output is connected to the voice recognizer 20 . Symbol 

20 R„ denotes feedback resistance of the amplifier 19. 

[0011] The parallel circuits of the resistors Ra to Rd are 
serially inserted in the inputs of the amplifier 19 by 
combination of on-state analog switches 18a to 18d, for 
deciding the input signal level of the amplifier 19 as well 

25 as the gain of the amplifier 19. 
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[0012] It is assumed that a voice signal Sv shown in Fig. 2(a) 
is input in the input terminal 11 in the aforementioned 
structure. The voice signal Sv from the input terminal 11 is 
applied to the comparator 12, and compared with the reference 
5 voltage Vr. When the voice signal Sy exceeds the reference 
voltage at a point tj, a set signal going high from low is 
generated at the output of the comparator 12. and the latch 
circuit 13 latches a high level in response to this set signal 
so that its output goes high from low. The one- shot timer 

10 circuit 14 is triggered due to this rising of the output of 
the latch circuit 13 for outputting a time signal sustaining 
a high level for a constant time up to a point tj as shown in 
Fig. 2(b) on its output and applying the same to the peak hold 
circuit 15. Thus, the latch circuit 13 and the one-shot timer 

15 circuit 14 form timer means reset by the reset signal generated 
by manipulation of the speaker for outputting the time signal 
sustained for a constant time by receiving the set signal from 
the comparator 12 in the reset state. 

[0013] The peak hold circuit 15 detects the peak of the voice 
20 signal Sv while the time signal St is applied, and thereafter 
continuously outputs the peak. In the illustrated example, 
the peak hold circuit 15 outputs a peak Vp detected at the point 
t2. The peak output from the peak hold circuit 15 is digitized 
by the A-D converter 16 and thereafter decoded by the decoder 
25 17 to be converted to a control signal for the analog switches 
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18a to 18d. One of the analog switches 18a to 18d to be turned 
on is decided by the control signal generated by the decoder 
17 , and a proper one of the resistors Ra to Rd is inserted between 
the input of the amplifier 19 and the input terminal 11. 
5 Therefore, the voice signal Sy is input in the amplifier 19 
through the inserted resistor. 

[ 0014 ] When a human speaks the same word, a specific voice signal 
responsive to each speaker is regularly obtained. When a 
subsequent voice signal level is adjusted with reference to 

10 the level of the initial part of this voice signal, therefore, 
the adjusted voice signal level can be rendered substantially 
constant even if the distance between the voice-electric 
conversion means and the speaker or the volume of the voice 
varies, while false recognition hardly results and the 

15 recognition rate is remarkably improved when voice recognition 
is performed with the adjusted voice signal. 
[0015] The speaker can generate the reset signal input in the 
reset input terminal 13' by manipulating a switch before 
speaking, for example, so that the gain of the amplifier 19 

20 is controlled in response to the subsequent voice signal every 
time the latch circuit 13 is reset by application of this reset 
signal . 

[0016] According to the aforementioned unit according to the 
present invention, a substantially identical voice signal is 
25 input in the voice recognizer 20 through the amplifier 19 as 
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shown in Fig. 3(c) whether the voice signal is at a small level 
as shown in Fig. 3(a) or the voice signal is at a large level 
as shown in Fig. 3(b) . When a reference pattern is previously 
registered by a signal corresponding to the voice signal shown 
5 in Fig. 3(c), therefore, it is possible to determine whether 
or not a voice is input by a specific person for generating 
a control signal such as a release signal, for example. However, 
signals in a time T between the points t^ and tj are not adjusted 
but different from each other, and hence this part is ignored 
10 in voice recognition. 
[0017] 

[Effect of the Invention] As hereinabove described, the unit 
according to the present invention sets the timer means reset 
by the reset signal by the manipulation made by the speaker 

15 before speaking with the set signal generated by subsequent 
speaking for generating a constant time and sets the gain 
inversely proportional to the peak of the voice detected in 
the constant time, whereby the voice conversion voltage signal 
input in the voice recognizer after amplification with the set 

20 gain can be regularly set to a constant level so that the result 
of voice recognition does not vary with the distance to the 
speaker or the volume of the voice but voice recognition is 
remarkably improved. 
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(54) METHOD AND DEVICE FOR EDITING VOICE 

(57)Abstract: 

PURPOSE: To provide a voice editing technique capable of monitoring 
an output value of a Codec previous to recording, setting an input level 
in an optimum value by a comparison result with a prescribed value and 
reducing a recorded data amount in accordance with the variation of the 
number of quantization bits of the sampling data In a system coding a 
voice and storing the data in a computer. 

CONSTITUTION: This device is a voice editing device constituted of a 
GPU, a memory, a voice source, an A/D converter, a Codec interface, a 
disk interface, a disk, a speaker and a D/A converter, etc. After a level 
prescribed value and a monitoring time are set first of all. the digital data 
are fetched, and the addition of the data and the counting of the number 
of data are performed for a fixed time (201-204). Then, a mean of input 
values is calculated after the prescribed time is over, and the mean 
value Is compared with the prescribed value, and when the mean value 
is larger, an input level is set in lower, and on the other hand, when the 
mean value is smaller, the input level is set In higher. 
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